Paradigm gaps are associated with weird “distributional semantics” properties

نویسندگان

چکیده

Abstract This study investigates the phenomenon of defectiveness in Russian case and number noun paradigms from perspective distributional semantics. We made use word embeddings, high-dimensional vectors trained large text corpora, compared observed nouns that are defective genitive plural, as suggested by Zaliznjak (1977) , with for non-defective nouns. When embeddings about 20,000 inflected forms were projected onto a two-dimensional space, clusters within found, suggesting global semantic similarity words same inflectional features. Moreover, lexemes characterized lower transparency, lexeme semantically less similar to each other, their meanings also more idiosyncratic. Furthermore, lexemes, further away idealized average case-number meanings, obtained averaging over all combination. As consequence, semantics predicted precisely simple model conceptualization assumes meaning given form is approximated well sum pertinent lexeme, case, case. conclude relationship between semantics, at least kind captured stronger than has been anticipated previously.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multimodal Distributional Semantics

Distributional semantic models derive computational representations of word meaning from the patterns of co-occurrence of words in text. Such models have been a success story of computational linguistics, being able to provide reliable estimates of semantic relatedness for the many semantic tasks requiring them. However, distributional models extract meaning information exclusively from text, w...

متن کامل

Functional Distributional Semantics

Vector space models have become popular in distributional semantics, despite the challenges they face in capturing various semantic phenomena. We propose a novel probabilistic framework which draws on both formal semantics and recent advances in machine learning. In particular, we separate predicates from the entities they refer to, allowing us to perform Bayesian inference based on logical for...

متن کامل

Distributional Semantics in Technicolor

Our research aims at building computational models of word meaning that are perceptually grounded. Using computer vision techniques, we build visual and multimodal distributional models and compare them to standard textual models. Our results show that, while visual models with state-of-the-art computer vision techniques perform worse than textual models in general tasks (accounting for semanti...

متن کامل

Distributional Semantics in Use

In this position paper we argue that an adequate semantic model must account for language in use, taking into account how discourse context affects the meaning of words and larger linguistic units. Distributional semantic models are very attractive models of meaning mainly because they capture conceptual aspects and are automatically induced from natural language data. However, they need to be ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The Mental Lexicon

سال: 2023

ISSN: ['1871-1340', '1871-1375']

DOI: https://doi.org/10.1075/ml.22013.chu